SMML estimators for linear regression and tessellations of hyperbolic space

نویسنده

  • James G. Dowty
چکیده

The strict minimum message length (SMML) principle links data compression with inductive inference. The corresponding estimators have many useful properties but they can be hard to calculate. We investigate SMML estimators for linear regression models and we show that they have close connections to hyperbolic geometry. When equipped with the Fisher information metric, the linear regression model with p covariates and a sample size of n becomes a Riemannian manifold, and we show that this is isometric to (p+1)-dimensional hyperbolic space H equipped with a metric tensor which is 2n times the usual metric tensor on H. A natural identification then allows us to also view the set of sufficient statistics for the linear regression model as a hyperbolic space. We show that the partition of an SMML estimator corresponds to a tessellation of this hyperbolic space. 1 The linear regression model To establish our notation we briefly recall some details of the linear regression model. The linear regression model is a statistical model for observed data y ∈ R (thought of as a column matrix) which is a realization of an n-dimensional, normally-distributed random variable Y with mean Aβ and variance-covariance matrix σIn, i.e., Y ∼ Nn(Aβ, σIn), where A is a full-rank n× p matrix called the design matrix, β ∈ R is a column matrix, σ > 0 and In is the n × n identity matrix. Here β and σ are unknown and are to be estimated in terms of y and A. In this paper, we will always require p ≤ n though for certain results (indicated in the text) we will also require p < n. The probability density function (PDF) of Y given values of the unknown model parameters β and σ is therefore (2πσ) exp ( −‖y − Aβ‖ 2 2σ2 )

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Differenced-Based Double Shrinking in Partial Linear Models

Partial linear model is very flexible when the relation between the covariates and responses, either parametric and nonparametric. However, estimation of the regression coefficients is challenging since one must also estimate the nonparametric component simultaneously. As a remedy, the differencing approach, to eliminate the nonparametric component and estimate the regression coefficients, can ...

متن کامل

SMML estimators for exponential families with continuous sufficient statistics

The minimum message length principle is an information theoretic criterion that links data compression with statistical inference. This paper studies the strict minimum message length (SMML) estimator for d-dimensional exponential families with continuous sufficient statistics, for all d. The partition of an SMML estimator is shown to consist of convex polytopes (i.e. convex polygons when d = 2...

متن کامل

Comparison Of Hyperbolic And Constant Width Simultaneous Confidence Bands in Multiple Linear Regression Under MVCS Criterion

‎A simultaneous confidence band gives useful information on the reasonable range of the unknown regression model‎. ‎In this note‎, ‎when the predictor variables are constrained to a special ellipsoidal region‎, ‎hyperbolic and constant width confidence bonds for a multiple linear regression model are compared under the minimum volome confidence set (MVCS) criterion‎. ‎The size of one speical an...

متن کامل

Truncated Linear Minimax Estimator of a Power of the Scale Parameter in a Lower- Bounded Parameter Space

&nbsp;Minimax estimation problems with restricted parameter space reached increasing interest within the last two decades&nbsp;Some authors derived minimax and admissible estimators of bounded parameters under squared error loss and scale invariant squared error loss&nbsp;In some truncated estimation problems&nbsp;the most natural estimator to be considered is the truncated version of a classic...

متن کامل

Hyperbolic Tessellations Associated to Bianchi Groups

Let F/Q be a number field. The space of positive definite binary Hermitian forms over F form an open cone in a real vector space. There is a natural decomposition of this cone into subcones. In the case of an imaginary quadratic field these subcones descend to hyperbolic space to give rise to tessellations of 3-dimensional hyperbolic space by ideal polytopes. We compute the structure of these p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1403.2201  شماره 

صفحات  -

تاریخ انتشار 2014